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We study the time dependent cross correlations of stock returns, i.e. we measure the correlation 
as the function of the time shift between pairs of stock return time series using tick-by-tick data. 
We find a weak but significant effect showing that in many cases the maximum correlation appears 
at nonzero time shift indicating directions of influence between the companies. Due to the weakness 
of this effect and the shortness of the characteristic time (of the order of a few minutes) our findings 
are compatible with market efficiency. The interaction of companies defines a directed network of 
influence. 
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I. INTRODUCTION 



In the risk minimisation of portfolio optimisation it is very important to consider how the returns of different 
companies correlate with each other. For this purpose the study of the equal time cross correlations between stocks 
has attracted much interest jj], [^]: The clustering properties and the comparison between the time and ensemble 
averages have provided much useful information in this respect. 

In statistical physics time dependent correlations are also of major interest. Due to their role in the fluctuation 
dissipation theorem they constitute as the main tool for determining transport coefficients. The famous Onsager reci- 
procity relations have their roots in the symmetry properties of the time-dependent cross correlations j| . Obviously, 
in an economic system there is no reason to assume that the time reversal symmetry or detailed balance is maintained. 
Nevertheless, it is of interest to investigate the time dependent cross correlations between stock returns because they 
contain information about the way how the prices influence each other, which are the dominant stocks and to what 
extent this dominance is reflected in the price changes under the conditions of an efficient market. 

The time dependent correlations between the indices of different stock exchanges were already studied empirically 
by 0, H and with a microscopic model by H. They showed that there exists a time shift in the cross-correlations 
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which arises from the fact that the different stock markets are open in different time cycles during the day as the 
\ Earth rotates. 

In this article we study time-dependent cross correlation functions of the returns of different stocks taken from the 
New York Stock Exchange (NYSE). As we will show, in many cases the maximum of the correlation as a function 
of time is not at zero but shifted, meaning that there exists some "pulling" effect between the companies, i.e., one 
of them influences the price behaviour of the others. However, this effect cannot be strong and the shift should be 
small, otherwise the effect could be utilized for arbitrage purposes, which is excluded from an efficient market. In 
fact, the investigated cases do not contradict with these criteria. The time dependent correlation between the stocks 
was studied before by |^]. The results of seems to contradict with ours because they studied weakly returns, 
and they found significant cross-correlations on the weakly scale. However, the results agree considering the "pulling" 
O effect, namely that the cross-correlations are asymmetric. 

The paper is organized as follows: In the next Section we give a short description of the data set. In Section 3 
we present the method of analysis and show how it works on an artificial set of data. Section 4 is devoted to the 
presentation of the results. The paper terminates with a discussion. 



II. DATA 



One of the stylized facts of markets is that the auto-correlation of stock returns decays exponentially with a very 
short characteristic correlation time, which is in the range of a few minutes P, This is understood as a signature 



of market efficiency II . Since cross correlations could also be used for arbitrage, one should not expect effects much 
beyond the above scale and therefore high frequency data are needed. We have analyzed the Trade and Quote (TAQ) 
database for N = 54 days over the time period from 01.12.1997 to 09.03.1998, which includes tick-by-tick data for 
about 10000 companies. Since this is quite a short time period we selected only those companies which were traded 
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more than 15000 times such that the number of companies reduced to 195. 

Having these 195 time series we have to face the following problem: since the tradings do not happen simultaneously, 
the values of the returns have also to be defined for the time intervals between the tradings. According to the rules of 
the stock exchanges we have considered the price as constant between two changes. The whole trading time T during 
one day is divided into n small intervals or windows of size At = T/n. If the trading happens in the interval t the 
return takes the value 

M h &>(*)] 
rAt{t) ~ 



In [p(t - At)} 



where for simplicity the day index i is not indicated; otherwise it is zero. 

In order to avoid the problem of major return values stemming from the differences between opening and previous 
day's closing prices we simply took the days as independent, i.e., the averaging is separated into two steps: Over the 
intraday trading time T and over the trading days. The data prepared in this way were then analysed from the point 
of view of time-dependent cross correlations. 



III. METHOD OF MEASURING THE CORRELATION 



As mentioned in the introduction we want to investigate the correlation of returns as the function of the time shift 
between pairs of stocks' return time series. The definition of the time-dependent correlation function Ca,b(j) is 



y-iA,B r \ (ri t (t)rl t (t + T))-(r£ t (t))(rl t (t + T)) 

C At ( T ) = Z~~Z ! W 



where a 1 — (^(rAt(t) — (^At(O)) 2 ^ i s the variance of the return. The notation (.) means averaging over the whole 
trading time T and important details of this process will be given in the following. 

Since the smallest interval between two tradings is one second, then At = Is seems to be a natural choice. However, 
for such a short window it quite often happens that at a given time step there is no transaction for one of the stocks 
(or for both) such that the return results in a zero contribution to the total correlations. Since the number of non-zero 
contributions is small, the correlation coefficients as a function of the time shift, r, will strongly fluctuate. To avoid 
this problem one has to enlarge the time difference, At, and average the correlations over the starting points of the 
returns. In this way the average in Eq. (|l|) means the following 

1 At-l T/ At 

<r£ t (t)rf t (t + r)) = - £ £ r&fo, + kAt)r% t (t + kAt + r) , (2) 

t =0 k=l 

where the first sum runs over the starting points of the returns and the second one runs over the At wide windows of 
the returns. 

In order to illustrate the effect that by taking larger time difference it is easier to identify the peaks in the correlation 
function - in other words to locate the time which gives the maximal correlation - we simulated two series of artificial 
data sets. The first is a one dimensional Persistent Random Walk (PRW) p2| , which deviates from a normal RW by 
the fact that the probability, a, that it jumps in the same direction as in the previous step is higher than 0.5, i.e., 
the random walker remembers its history. The probability of an increment x(t) £ {±1} at time t is 

P (x(t)) = a8 x ^ t _x) + (1 - a)(l - 5 x , x (t_i)). (3) 

The other time process is simply generated from the first one by shifting it by To and adding to it Gaussian random 
noise with zero mean and width a: 

y(t)=x(t-T )+£(t), teN(Q,a) (4) 

The advantage of this model is that the correlation function can be calculated analytically and the position of the 
maximum correlations can be adjusted at To: 

\Jo + 1 

After generating the two data sets we randomly drop points from both sets and keep only the fraction p of the points 
in order to have the same problem as with the original data sets that the jumps do not occur at the same time in the 
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FIG. 1: Illustration how the correlation as the function of the time shift, r depends on the time deference, At of the return. 
The correlation was measured on the artificial data sets with parameters: p = 0.01, to = 200, a — 1000, a — 0.99. The figures 
shows that while for At = 1 no peak in the correlation function can be identified, by increasing the time difference to At = 10 
the peak at r = 200 appears. 



different time series. It is apparent from Figjlj. that increasing the time difference, At helps identifying the time of 
maximum correlation. The fact that we dropped random points from the original data changes slightly the position 
of the maximum correlation as compared to Eq. (|^) . 

Fig.[j]. shows that the decay of the correlation function is not exponential as in Eq.(|^) but it decays approximately 
linearly down to the noise level. This is due to the averaging procedure we use with the increased time difference, At. 
The correlation corresponding to larger time difference Ca*(t) can be written as the weighted sum of the one-step 
correlation functions, Ci, which belongs to At — Is: 
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(6) 



IC x {t- At+1) 
+ At Ci(t) + .. 



.. + (At-l) Ci(r-l) 
1Ci(t + At - 1) , 



where 6 — r / \ t =i is the return belonging to one second time difference. 

Changing t in Eq.(^|) means changing the weights of the one-step correlation functions. Since the correlation function 
of the original data sets, see Eq.(|^), decays exponentially, the maximum, C\(to), will give the main contribution to 
the sum in Eq.(^) and because its weight is linear in r, then Ca*(t) will decay approximately linearly. (It should be 
noted that the normalization factor in Eq.(|l|) does not change this consideration since it is independent of r.) 

There is only one question left namely how can we choose a smaller value for r than for At? The time dependent 
cross correlation of the returns contains a product of the return of company A with that of company B shifted by r. 
As the return is defined with the window At the values of r could only be multiples of At. The solution is simply 
that one shifts the starting point of the return of company B by r, as evident in Eq. (Q), i.e. we make the time shift 
in the price function and in this way allow any time shift larger than the minimum trading time. 

The above arguments of averaging give support to choose a value for At that is larger than the minimum trading 
time. However, it should not be too large since the averaging leads then to the smearing out of the maximum. As the 
width of the one-step correlation should be a few minutes, much larger time difference would mean that in the sum 
of Eq.(||) we mainly have terms, which are only due to noise. This suggests that the optimal choice for At is of the 
order of magnitude of one minute. 



IV. RESULTS 



As mentioned in Sec. II we have studied the correlation of 195 companies, which were traded during the available 
54 days more than 15000 times. In accordance with the arguments presented in the previous Section we have used 
At = 100 but checked that the results are quite robust within the range 50 < At < 500. As already mentioned, 
we averaged over the starting points of the returns. For the maximum of the time shift we choose 2000s. This is 
definitely beyond any reasonable characteristic time for correlations in return values because of market efficiency. In 
fact, using such a large value for the time shift allows us to measure the noise level, which the possible effect should 
be compared with. 
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FIG. 2: Example for the measured shifted-time correlation function. The two companies are: Ensco International (ESV) and 
Exxon Corp. (XON). The maximum correlation value is at -100 sec, which means that the return time series of ESV has to be 
shifted back in order to get the maximal correlation i.e. the price changes happen later in time, in other words ESV is pulled 
by XON. 




For the resulting 195*194/2 correlation functions we measured the maximum value, C max , the position r max at 
which time shift this maximum was found, and the ratio, R, of the maximum and strength of the noise defined as the 
variance of the correlation values for time shift values between 600 and 2000s. We looked at those pairs of companies 
for which these three values exceeded a prescribed threshold values, which we defined for At = 100 as: r max > 100, 
Cmax > 0.04, R > 6.0. One example of the measured correlation function can be seen in Fig. ||. In this case the 
company XON (Exxon) - which is a large oil company - "pulls" the ESV (Ensco International) which provides drilling 
service to oil and gas companies. This effect is quite weak but the large value of R shows that it is significant. 

The maximal value of the correlations turn out to be quite small, in average less than 0.1, (e.g. see Fig. |]), although 
the generally quoted equal time cross correlations have much larger values. The root of this effect lies in the choice 
of the time difference, At. Increasing At increases the values of the equal time correlations ]l3| ]. 

In some cases the position of the maximum correlation was found at values much larger than few minutes which 
would be inconsistent with the efficient market behavior. A closer inspection revealed that in such cases the peak 
in the correlation function is caused by two major return values in the considered time series. The contribution of 
their product to the correlation - at appropriate value of the time shift - dominates the maximum of the correlation 
function. These are not the effects we are looking for, therefore we did not take them into account. In order to check 
whether the peak in the correlation is due to some single large return value or due to persistent influence of one of the 
stocks on the other we also studied how the correlation changes if the analyzed time window changes. We measured 
the shifted time correlation also for the first and for the second half of the given 3 month period and studied whether 
the correlation function remains qualitatively the same. 

We also measured the correlation for shorter and for larger time difference, i.e. At = 50 and At = 200, respectively, 
because it may happen that by changing the time difference also the position of the maximal correlation value changes 
due to the averaging procedure described in Eq.(j6|). This can happen if the time dependent correlation function for 
At has an asymmetric peak; see Fig. ||. Let us suppose that the left hand side is higher than the right hand one. For 
At' > At the maximum will be shifted towards left as it can be shown through simple examples using Eq.(§). In the 
case of Fig. ^ the correlation function is also asymmetric but not at its peak (not near the maximum), which means 
that the maximum will not be shifted by increasing the time difference, At. 

The results show that the characteristic time shift is around 100 sec. which is consistent with the effective market 
hypothesis. A time shift larger than the characteristic time of the decay of the return auto-correlations would 
contradict with the efficient market picture and could be used to arbitrage. 

In general the more frequently traded companies are influencing ("pulling") the less frequently traded ones. This is 
not surprising since obviously the more frequently traded companies are more important. It is therefore more likely 
that they influence a smaller company than the other way around. Although this is the generic situation, there are a 
few exceptions when a less often traded company "pulls" the other one. 

In this study we found that in general one "small" company is influenced by many "large" companies and one 
"large" company pulls many "small" ones. As can be seen in Fig. ^ this behaviour can be represented as a graph of 
directed links, where there are nodes from which many links go out (meaning that this node is influenced by many 
others) and there are other nodes where many links go in (these are the big companies influencing the less important 
ones). 
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FIG. 3: Example for a pair of company for which the correlation function has an asymmetric peak. The curve with circles 
belongs to At — 100, the other with the squares to At = 500. The maximum of the second curve is at smaller time value 
because the left side of the peak - in the case of the curve with circles - is higher. 
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FIG. 4: Representation of the pulling effect between the companies. The direction of the arrows show which company is 
pulling the other. The companies which appear in the figure show the most significant effects. 



V. DISCUSSION 



In this paper we have analyzed the time dependent cross correlation functions of the returns of stocks at the NYSE. 
We have studied whether there exists any pulling effect between stocks, i.e., whether at a given time the return value 
of one stock influences that of another stock at a different time. 

In general we can see two types of mechanisms to generate significant correlation between two stocks: 

(i) Some external effect (e.g. economic, political news etc.) that influences both stock prices simultaneously. In 
this case the change for both prices appears at the same time, and the maximum of the correlation is at zero 
time shift. 

(ii) One of the companies has an influence to the other (e.g. one of the company's operation depends on the other.). 
In this case the price change of the influenced stock appears later in time because it needs some time to react 
on the price change of the first stock, in other words one of the stocks pulls the other. This pulling effect has 
been the main focus of our study in this article. 

Since the correlation between stocks was expected to be small and the available set of data was somewhat limited 
we had to do a careful analysis. For this reason and test purposes we generated an artificial data set with which we 
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showed that by increasing the time window of the returns and by averaging over their starting points the detection 
of the correlation effect gets easier. 

With the real data we saw that it is possible to find pairs of stocks where the pulling effect exists, though it turned 
out to be small. In addition, the characteristic time shift - given by the position of maximum correlation - was found 
to be of the order of a few minutes. These findings are compatible with the efficient market picture. 

As for the pulling effect we found that generically the more traded, and thus more important companies pull the 
relatively smaller companies. This result is consistent with that of Q|. In this light it is not surprising that in the 
study of the time-dependent cross correlation functions of pairs of companies in the Dow Jones Industrial Average 
index no pulling effect was found. This underlines the fact that the Dow Jones companies are indeed among the most 
important stocks of the New York Stock Exchange. 

Finally we would like to propose that although the observed pulling effect was small, our careful analysis could 
show that it is significant for a considerable set of pairs of companies. We think that this property of the stock market 
should be added to the so called stylized facts. Of course, further analysis on more extensive data is needed to clarify 
further details of the time dependent cross correlations. 
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APPENDIX A: COMPANY NAMES AND DESCRIPTION 



Symbol Name 

ARC Atlantic Richfield Co. 

AUD Automatic Data Processing 

BLS Bellsouth Corp. 

BMY Bristol-Myers Squibb Co. 

CCI Citicorp 

CD Cendant Corp. 

CDG Cliffs Drilling Co. 

CHV Chevron Corp. 

CPQ Compaq Computer Corp. 

DIS Walt Disney Co. 

EK Eastman Kodak Co. 

ESV Ensco International Inc. 

EVI Energy Ventures Inc. 

FCN First Chicago NBD Corp. 

FLC Falcon Drilling Co. Inc. 

GE General Electric Co. 

GLM Global Marine Inc. 

GLX Glaxo Wellcome Pic. 

GTW Gateway 2000 Inc. 



Description 

Petroleum refining 

Data communications and information services. 
Telephone communication 
Pharmaceutical preparations 
Banking. 

Travel, real estate, vehicle, and financial services 

International drilling company 

Energy and chemical company, petroleum refining 

Electronic computers 

Entertainment company 

Photography 

Drilling oil and gas wells 

Oil and gas field machinery 

Banking 

Marine-based drilling 
Electronics, machinery 
Drilling oil and gas wells 
Pharmaceutical preparations 
Electronic computers 



TT A T 

HAL 


TT 11*1, i_ /"I 

Halliburton Co. 


Oil held services 


HD 


Home Depot Inc. 


Home improvement retailer 


HWP 


Hewlett-Packard Co. 


Computers 


IBM 


T 1 1 * 1 TV • AT" 1 ' /~A 

International Business Machines Corp. 


Computers 


IP 


International Paper Co. 


Paper 


JNJ 


Johnson & Johnson 


Health care products 


JPM 


"TV IT 1 1 V T 

Morgan J. P. Co. Inc. 


Banking 


KO 


Coca-Cola Co. 


Solt drmks 


LEH 


T 1 I V i 1 TT 1 1 * 

Lehman Brothers Holdings 


Financial services 


MOB 


Mobil Corp. 


Petroleum refining 


ML) 1 


Motorola Inc. 


Semiconductor technology 


MRK 


"TV IT 10 /"I T 

Merck & Co Inc. 


Pharmaceutical preparations 


MU 


TV IT' m 11 T 

Micron lechnology Inc. 


Semiconductor technology 


NE 


AT 11 T — \ '11* /"I 

Noble Drilling Corp. 


Drilling oil and gas wells 


NN 


AT 1 * 1 AT J_ 1 f 1 

Newbridge Networks Corp. 


rTT 11 1 i_ 1 1 j_ 

lelephone and telegraph apparatus 


NOKA 


Nokia Corp. 


Mobile phones 


PFE 


Pfizer Inc. 


Pharmaceutical preparations 


PG 


Procter & Gamble Co. 


O J i_l J a. j_ 

Soap and other detergents 


RD 


T~V 1 "TA i 1 T "V j 1 /^N 

Royal Dutch Petroleum Comp. 


Petroleum Refining 


SBH 


Smithkline Pic 


Pharmaceutical preparations 


SLB 


Oil 1 T**j_1T rnTv 

benmmberger Limited LID 


Oil and gas field services 


CTTT) 

SUB 


o ' x- t~> i 

Summit Bank Corp. 


Banking 


1BR 


Telecomunicacoes Brasileiras S.A. 


lelecommumcations 


TER 


Teradyne Inc. 


Electrical instruments 


TMX 


m IP l a /r * 

Iclclonos de Mexico 


Telephone communication 


TRV 


Travelers Group Inc. 


Fire, marine and casualty insurance 


UAL 


UAL Corp. 


Air transportation 


VKL 


Varco International Inc. 


Oil and gas field services 


WDC 


Western Digital Corp. 


Computer storage devices 


WLA 


Warner Lambert Co. 


Pharmaceutical preparations 


WMT 


Wal-Mart Stores Inc. 


Retail - Variety stores 


XON 


Exxon Corp. 


Petroleum refining 



